Handling Disagreement in Hate Speech Modelling

نویسندگان

چکیده

Abstract Hate speech annotation for training machine learning models is an inherently ambiguous and subjective task. In this paper, we adopt a perspectivist approach to data annotation, model evaluation hate classification. We first focus on the process argue that it drastically influences final quality. then present three large datasets incorporate annotator disagreement use them train evaluate models. As main point, propose through lens of by applying proper performance measures both annotators’ agreement models’ further poses intrinsic limits achievable When comparing annotators, observed they achieve consistent levels across datasets. reflect upon our results some methodological ethical considerations can stimulate ongoing discussion modelling classification with disagreement.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hate Me, Hate Me Not: Hate Speech Detection on Facebook

While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical viol...

متن کامل

Hate speech, volition, and neurology

In ‘A Hypothetical Neurological Association between Dehumanization and Human RightsAbuse’,1 GailMurrowandRichardMurrowposit a biological explanationof how hate speech can spur violence, not only among individuals but, even, on a societal scale. They elaborate historical examples, cite to neuronal studies on patterns of responses in observation of pain and suffering to explain the dehumanization...

متن کامل

Challenges in discriminating profanity from hate speech

In this study we approach the problem of distinguishing general profanity from hate speech in social media, something which has not been widely considered. Using a new dataset annotated specifically for this task, we employ supervised classification along with a set of features that includes n-grams, skip-grams and clustering-based word representations. We apply approaches based on single class...

متن کامل

Detecting Hate Speech in Social Media

In this paper we examine methods to detect hate speech in social media, while distinguishing this from general profanity. We aim to establish lexical baselines for this task by applying supervised classification methods using a recently released dataset annotated for this purpose. As features, our system uses character n-grams, word n-grams and word skip-grams. We obtain results of 78% accuracy...

متن کامل

Automatic Detection of Online Jihadist Hate Speech

We have developed a system that automatically detects online jihadist hate speech with over 80% accuracy, by using techniques from Natural Language Processing and Machine Learning. The system is trained on a corpus of 45,000 subversive Twitter messages collected from October 2014 to December 2016. We present a qualitative and quantitative analysis of the jihadist rhetoric in the corpus, examine...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Communications in computer and information science

سال: 2022

ISSN: ['1865-0937', '1865-0929']

DOI: https://doi.org/10.1007/978-3-031-08974-9_54